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Abstract — This paper determines the range of feasible values of 
standard error exponents for binary-input memoryless symmetric 
channels of fixed capacity C and shows that extremes are attained 
by the binary symmetric and the binary erasure channel. The 
proof technique also provides analogous extremes for other 
quantities related to Gallager's E function, such as the cutoff 
rate, the Bhattacharyya parameter, and the channel dispersion. 



I. Introduction 

In the context of coded communication, the channel coding 
theorem relates the error probability and the code rate, showing 
that there exist codes whose error probability tends to zero 
provided that the code rate is smaller than the channel capacity. 
For uncoded systems, the error probability and the channel 
capacity are also related. In particular, references [1-3] show 
that given one of the two values, tight bounds on the other can 
be given for the family of binary-input memoryless and sym- 
metric (BIMS) channels. Such channels are described by the 
channel transition probability P Y \x{y\x), where x € {x ,xi} 
and y e y. We assume that the channel output alphabet 
y has finite size, though our approach also holds for well- 
behaved channels with infinite alphabet size, like the binary- 
input additive white Gaussian noise (BIAWGN) channel. We 
adopt Gallager's definition of symmetric channel [4, p. 94], 
that is, a channel is said symmetric if the channel transition 
probability matrix (rows corresponding to input values) is such 
that it can be partitioned in submatrices for which each row is a 
permutation of any other row and each column is a permutation 
of any other column. Both the binary erasure channel (BEC) 
and the binary symmetric channel (BSC) are symmetric. 

More precisely, references [1-3] show that the uncoded 
error probability of any BIMS channel with capacity C is 
upper-bounded by that of the BEC and lower-bounded by 
that of the BSC of the same capacity. Similar results have 
been found in [5, 6] for the Bhattacharyya parameter, a simple 
upper bound to the uncoded error probability; here only the 
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extremal property of the BEC was proved. In the context of 
iterative decoding, analogous extremal properties of the BEC 
and BSC have been found [7, 8] for the building blocks of 
iterative decoders for low-density parity-check codes, namely 
variable-node and check-node decoders. 

Upper and lower bounds to the error probability of good 
codes can be given in terms of error exponents, e.g. Gallager's 
random coding bound [4, Thm. 5.6.3], the sphere-packing 
bound by Shannon et al. [9] and Arimoto's strong converse 
bound [10]. These exponents are expressed as optimization 
problems involving Gallager's E function [4, Eq. 5.6.14], 

E (p)±-logF(p), (1) 



where 



F(p) 4 E 



'E[P Ylx (Y\X')^\Y] 



(2) 



P Y \x(Y\X)^ J 

and the pair (X, Y) is distributed according to PxPy\x- Here 
and throughout the paper E[-] denotes the expectation of a 
random variable and all logarithms are in base 2. 

Equiprobable inputs maximize the E n function for BIMS 
channels [11, p. 203], and we henceforth assume such distri- 
bution, i.e. Px(xi) = Px(x 2 ) = \- 

In this paper, we characterize the feasible values of Eo(p) 
for an arbitrary BIMS channel of fixed channel capacity C 
and show that the E function is upper-bounded (resp. lower- 
bounded) by that of the BEC (resp. BSC) of the same capacity. 
Since the aforementioned exponents are expressed using the 
£0 function, we are able to find their extremal values. In 
fact, our analysis leads to similar results for the cutoff rate, 
the Bhattacharyya parameter, the channel dispersion and to a 
number of other extensions. 

II. Feasible Pairs of capacity C and F(p) Function 

The F(p) functions for the BEC and BSC of era- 
sure/crossover probability e, respectively denoted by F hec (p) 
and F hsc (p), are given by 

F hec (p) 4 2-'(l -e)+ e (3) 

F hsc {p) 4 2- p (e^ + (1 _ £ )tt^ 1+P . (4) 

Using the capacity expressions for the BEC, C hcc = 1 — s, and 
BSC, C hsc = l — h(e), we can find the erasure/crossover prob- 
ability corresponding to a given capacity C and parametrize 
the F bcc (p) and function F bsc (p) as functions of C, namely 

F boc (p;C7) = l + (2-' , -l)C, (5) 
F hsc (p;C) 



2- p - C)) x +> + (1 - /r^l - C)) 1 +" 



i+p 



(6) 
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where h(p) = — plogp — (1 — p)log(l — p) is the binary 
entropy function, and hr l (x) denotes the inverse of h(p) for 
p € [0, \]. C hcc (F{p)),C hsc (F{p)) are respectively defined 
as the inverses of Eqs. (5), (6) with respect to C. 

For BIMS channels, one has the bounds < C < 1 and < 
F(p) < 1 for p > and 1 < F(p) < 2 for -1 < p < 0. This 
is a consequence of the facts that F(p) is non-negative and 
non-increasing for p > — 1 [4, App. 5B], that lim^^i F(p) = 
2, and that F(0) = 1. It is however not apparent whether 
further limitations exist on the feasible pairs of capacity C and 
F(p). Against this first impression, the next theorem tightly 
characterizes the set of possible pairs of capacity C and F(p) 
function for any BIMS channel (see Figure 1). In the next 
section, we apply this theorem and prove several analogous 
characterizations for other relevant quantities in the analysis 
of the error probability over BIMS channels. 

Theorem 1: For any BIMS channel with capacity C and 
function F(p) for p > — 1, the following statements hold 

1) the function F(p) of the channel satisfies 

F hcc {p-C)<F{p)<F hsc {p-C); (7) 

2) the capacity C of the channel satisfies 

C hsc (F{p)) <C< C hcc (F{p)) for - l<p<0, 

(8) 



C hcc (F(p)) < C < C hsc (F{p)) for p > 0. 



(9) 



The extremes in (7)-(9) are attained by the BEC and the BSC. 

Furthermore, for a given pair (C, F(p)) satisfying the 
inequalities in (7) or (8), there exists a BIMS channel with 
capacity C and function F(p). Conversely, if the inequalities 
do not hold for the pair (C, F(p)), there exists no such BIMS 
channel with capacity C and function F(p). 

A. Proof of Theorem 1 

The proof is built around the idea that every BIMS channel 
admits a decomposition into subchannels that are BSCs. This 
decomposition follows directly from Gallager's definition of 
symmetric channels [4, p. 94] as used in this paper. A formal 
description may be found e.g. in [3, 7]. Here we deem identical 
the BEC with erasure probability 1 and the BSC with crossover 
probability \ . In this decomposition, each channel output Y is 
associated with an index A = f(Y) which is independent of 
the input and depends on the channel output only. We denote 
by Pa (a) the probability mass or density function of subchan- 
nel a, and by y{a) the corresponding binary output alphabet 
of the BSC with index a. Assuming such a decomposition, 
and since P Y \x{y\x) = Py\x,a{v\x, o)Pa{o) [3,7] we have 



F(p) = E 



= E 



E 



' E[P Y \x(Y\X')^\Y] \ 
\ Py\x(Y\X)^ J 

'E[P Y \xA Y \X',A)Th\Y,A] 



(10) 



P Yl x !A (Y\X,A)^ ) 

= E[F hsc (p;C(A))} , 
where C(a) denotes the capacity of subchannel a. 



A 



(11) 
(12) 
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Fig. 1. Region of feasible points (C,F(p)) for p = -0.99, 1, 10. The 
upper curves correspond to the BSC and the lower straight lines to the BEC. 



The following lemma is proved in Appendix A. 

Lemma 1: The function F hsc (p; C) is concave in C € [0, 1] 
for any p > — 1, non-decreasing for — 1 < p < 0, and non- 
increasing for p > 0. 

Noting that E[C(A)] = C, and given the concavity of the 
function F hsc (p;C), we apply Jensen's inequality to obtain 



F(p)=E[F hsc (p;C(A)) 
< F hsc (p;E[C(A)]) 
= F hsc (p;C). 



(13) 
(14) 
(15) 



The bound is obviously achieved when the channel is a BSC. 

Since F hsc (p; C) is concave, we can lower-bound it by 
a straight line joining the points F hsc (p;0) (C = 0) and 
F bsc (p;l) (C = 1) (see Figure 1), and then evaluate the 
expectation, i.e., 

F hsc (p; C) > F hsc (p: 0) + C (F hsc (p; 1) - F hsc (p; 0)) 

(16) 

= 1 + C(2- p - 1) (17) 



pbcc 



(p;C). 



(18) 



This bound is obviously achieved when the channel is a BEC, 
thus proving Eq. (7). 

Eq. (7) determines the boundaries of the region of feasible 
pairs (C,F(p)). Since F bsc (p;C) is concave and F hcc (p;C) 
is convex, the region of feasible pairs is convex. Moreover, the 
functions F hsc (p;C) and F hec (p;C) are non-decreasing for 
— 1 < p < and non-increasing for p > 0. Fixing the value 
of F(p), the convexity of the region implies Eq. (8). 

We next prove that the region of feasible pairs (C, -F(p)) is 
connected by constructing a BIMS channel with corresponding 
capacity C and function F(p). Consider a binary symmetric- 
erasure channel (BSEC) with input alphabet {a^xi}, out- 
put alphabet {yo,Ui,y c }, cross-over probability e s and era- 
sure probability e c . Its transition probabilities are given by 
Py\x(Vo\x ) = Py\x(Vi\xi) = l-e e -e s , P Y \x(yo\xi) = 
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Py\x{yi\xo) = £s, and P Y \x(y c \xo) = Py\x(ye\xi) = s c . 
The capacity C hscc and function F hscc (p) are respectively 

C- = (l- £e )(l-/ l ( i -^- £e )) (19) 

F hscc (p) = 2-P(eI^ p + (1 - e s - e c )^) 1+p + e e . (20) 

For fixed C, there exist several BSEC channels with capacity 
C, among them a BSC and a BEC. Each of them is charac- 
terized by a pair of probabilities e s and e e . The corresponding 
F bsec (p) function is given by Eq. (20). Since the function 
F bsec (p) is continuous in e s and e c , one can always find a 
BSEC with capacity C whose function F bsec (p) coincides 
with the desired F(p). 

B. Applications 

In the proof of Theorem 1, we exploited the fact that the 
region of feasible pairs (C, is convex and connected 

to characterize the extreme values of the capacity C or the 
function F(p). In this section, we apply the theorem to 
provide extreme values for other relevant quantities in the error 
probability analysis of channel coding. A simple extension to 
channel parameters G given by G(p) = g^F(p) s ), where g(-) 
is a monotonic continuous function, will prove convenient. 

Theorem 2: Let the channel parameter G be given by 
G(p) = g(F(p)}, where g(-) is a monotonic strictly increasing 
continuous function. For any BIMS channel, we have that 

1) the channel parameter G(p) satisfies 

g(F hec (p;C)) < G(p) < g(F hsc (p;C)); (21) 

2) the channel capacity C satisfies 

C hsc (G(p)) <C< C hcc (G(p)) for - 1< p < 0, 

(22) 

C hcc (G(p)) < C < C bsc (G(p)) for p > 0. (23) 

The inequalities (21)-(23) are reversed if g(-) is monotonic, 
strictly decreasing and continuous. 

Gallager's function: By letting g{x) = — log(x), the 
previous theorem readily gives the extremes of Gallager's 
function E (p) = — logF(p) for a fixed capacity, and the 
extremes of the capacity for a fixed E a . 

Cutoff rate: A particular case of the Eq function is the 
cutoff rate, given by i? = E (l). Thus, the above result also 
gives the extremes of the cutoff rate. 

Bhattacharyya parameter: A related quantity is the Bhat- 
tacharyya parameter Z, given by 

Z = /Z \J P Y\x(y\xo)P Y \x(y\xi) = 2F(1) - 1. (24) 

yey 

The BSC/BEC have the largest/smallest possible Bhat- 
tacharyya parameter for BIMS channels of capacity C, in- 
terestingly giving the reverse extremes of the uncoded error 
probability [1-3]. This result recovers Sason's [5] and Ankan's 
[6] bound for the BEC, and provides the extreme in the other 
direction attained by the BSC. Fig. 2 shows the bounds to C 
for a given value of Z from Theorem 2, as well as Ankan's 
generic bounds for binary-input discrete memoryless channels 
[6, Eqs. (1), (2)], illustrating some improvement. 




Fig. 2. Upper and lower bounds to the capacity C as a function of the 
Bhattacharyya parameter Z. Ankan's upper and lower bounds [6, Eqs. (1), (2)] 
and the BIAWGN channel curve (dashed line) are also shown for reference. 



Random coding exponent: The random coding exponent 
E r (R) [4, Sect. 5.6], given by 

E T (R)= jmax E {p) - pR, (25) 

provides an upper bound to the error probability of codes of 
rate R. This exponent involves a maximization of a function 
that, for fixed p falls under the conditions for applicability of 
Theorems 1 and 2. Therefore, the exponent E T (R) satisfies 

E* SC (R; C) < E T (R) < E* CC (R; C). (26) 

Figure 3 illustrates the extremes of random-coding error 
exponents E I (R). The random-coding error exponent of an 
arbitrary BIMS channel must lie in the shaded area, two such 
examples are the BIAWGN channel of the same capacity (with 
and without fading). 

Expurgated error exponent: For rates below the channel 
critical rate, the expurgated error exponent E CX (R) [4, Sect. 
5.7], given by 

£ cx (i?) = max E x (p) - pR, (27) 
P >i 

provides a tighter estimate of the error probability of good 
codes than the random-coding exponent. The function E x (p) 
is expressed in terms of the Bhattacharyya parameter Z as 

1 + z L 

E x {p) = -p\og^ r - . (28) 

Theorem 2 provides the extremes of the expurgated exponent. 

Strong converse exponent: In [10], Arimoto lower- 
bounded the error probability of block codes at rates above 
capacity in terms of the function E SC (R) given by 

E SC (R)^ sup E (p)-pR. (29) 

-Kp<0 

Theorem 2 also provides the extremes of this exponent. 
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Sphere-packing exponent: The error probability of codes 
of rate R is lower-bounded by a bound that depends on the 
sphere-packing exponent [9] E sp (R), given by 



E sp (R)± sup E (p)-pR. 



(30) 



p>0 



Again, Theorem 2 provides the extremes of this exponent. 

Threshold-decoding error exponents: The exponent of 
random-coding bounds based on threshold decoding can also 
be expressed in closed form. Shannon [12] derived the expo- 
nent of Feinstein's bound to the error probability [13]. More 
generally, the exponent corresponding to a generalized form 
of Feinstein's bound [14] can be expressed as 

E (p)-pR 



E^{R) 



sup 
p>0 1+P 



(31) 



Theorem 2 directly gives the error exponent extremes for the 
generalized Feinstein's bound. 

The exponent of the dependence-testing bound [15] is [14] 

^ dth (R) 



E? 



max En (p, s 

0<p<l 



1) 



where E (p, s) = 
F(p,s)-- 



pR, 

log F(p, s), for s > 0, and 



(32) 



E 



'E[P Ylx (Y\X>y\Y] 
Pr\x(Y\X)> 



(33) 



Following similar and somewhat simpler steps to those in the 
proof of Lemma 1, one can prove that F(p, s = 1), evaluated 
for a BSC with capacity C, is concave in C. Therefore, 
Theorem 2 holds and shows that the exponent of the DT bound 
has similar extreme values. 

Channel dispersion: Recently, the Gaussian approxima- 
tion to the error probability P e of length-n codes at rates 
close to the capacity has received renewed attention. In this 
approximation, a critical channel parameter is the dispersion 
V, which for BIMS channels [12, 15] is given by 

V =EY{hc)=- E ^ = 0) - (34) 



Moreover, it can be proved that one can choose either the 
E (p) function or the simpler En(p, 1) to compute the latter 
derivative, that is E'J(p = 0) = E' \p = 0,1). As proved 
in Appendix B, the third derivative of Eq(p,1) at p = 
is bounded for BIMS channels. Thus, a second-order Taylor 
expansion of E (p, 1) around p = shows that E' ' (0, 1) has 
the same extremes as E (p, 1). As illustration, Figure 4 depicts 
the possible values of channel dispersion as a function of the 
capacity C of the BIMS channel. The dashed line, which lies 
within the shaded area indicating the feasible region of pairs 
capacity/dispersion, corresponds to the BIAWGN channel. 

Error probability of specific codes: Our Theorems may 
also be applied to specific codes C with a given distance 
spectrum by means of the Shulman-Feder bound [16] (see 
also [17]) given by 



- 1 log P e {C) > E, ( R+ -loga c 
n \ n 



(35) 



where etc is a function of the distance spectrum of the code 
that quantifies how far the distance spectrum of C is from that 
of the ensemble average. 

Exact error probability: One might wonder whether our 
extremal results extend to the actual error probability. The 
answer is not immediately obvious. For uncoded transmission 
(a code of length n = 1 and rate R = 1) over a given BIMS 
channel of capacity C the error probability Pi, is upper- and 
lower-bounded by that of the BEC and the BSC, respectively, 
/i _1 (l - C) < P b < \{1 - C) [2]. In contrast, the extremes 
of the exponential bounds to the error probability, including 
the Bhattacharyya parameter, are reversed. This phenomenon 
suggests the existence of a pair (n,R n ) such that a crossing 
point occurs, in the sense that for rates above (resp. below) 
R n the extremes may be those of uncoded transmission (resp. 
the error exponents). 

Connection with Arikan, Telatar [18], and Alsan [19]: 
Unpublished work by Arikan and Telatar [18] uncovered re- 
sults of similar nature to those reported in this paper, showing 
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that for channels with a fixed rate R p = for < p < 1, 

the random coding exponent satisfies 



df 



ilc > for -1 < p < and < for 



E» ec {R P ) < E r (R p ) < E* sc (R p ). 



(36) 



For p = 0we have that R p = C and we obtain the trivial result 
that = E^ cc (R p ) < E r (R p ) < E^ sc (R p ) = 0. Instead, 
our results compare channels of a fixed capacity and provide 
the extremal values of the random-coding exponent and other 
quantities. The suitability of either of these two approaches to 
the problem may depend on the specific application. A more 
recent result by Alsan [19] recovers both Theorem 1 in this 
paper and the results in [18] as particular cases, for BIMS 
channels in the interval < p < 1. 

Appendix A 
Proof of Lemma 1 

We aim at proving the concavity of the function 

f{C)=(e{C)^ + (l-e(C))^y +P , (37) 

where e(C) is itself a function of C, namely e = /i _1 (l — C). 
Without loss of generality, we limit our attention to the interval 

,2 f 

e G [0, 1/2]. The function is concave if < 0. 
Applying the chain rule of derivation, we have that 



+ 



df d 2 e 
fedC 2 ' 



(38) 
(39) 



df df de 
dC = d~ed~C 

= d 2 J_(de_ 
dC 2 de 2 \dC 

Direct computation gives 

<IJ - (e^ +(l-e)^y(e^ -(l-e)^) (40) 



de 
de 2 



£!+p + (1 - e)" ] 



1 + P 



l + 2p 

- e)) 1+ » 



An application of the inverse function theorem yields 

de 1 



dC log ji 

d 2 e 

dC 2 



1 



(41) 

(42) 
(43) 



£ (l- £ )(to gT ^j In 2 
The derivatives with respect to C are therefore given by 

" f = (eife + (1 - e)^Y (e^ - (1 - e)^) 

(44) 



dC 



dC 2 



1 + P 



+ 



£ i+p + (l - e ) i+p 

/ \ 2 l + 2p 

(log^j ( £ (l- £ )) — 
1 - 2s + (1 - e)i+p - £i+^(l - e) 1 ^ 



(45) 



Since we have that 
p > 0, we conclude that /(C) is increasing and decreasing in 
the respective ranges of p. 
The term before the brackets 



( £!+p + (1 - £) 1+P J 
/ \ 2 l + 2p 

(log^j ( £ (l- £ ))w 



(46) 



is always non-positive for — 1 < p < oo. Therefore, it suffices 
to show that the function 

, \ a P 1 - 2e + e I Tp(l - e)i+p - £i+p(l - g-)iTp 

5(e ' /5) = TT^ + 



In 



is non-negative for e G [0, |] and — 1 < p < oo 



(47) 



Let z = t^t € [0,1]. With this change of variables we 



obtain 

i p 

I \ P 1 ~ Z + Zl + " — Zl + P /„ox 

1 + /? (1 + z) lnz 

We wish to show that g(z,p) > 0. The partial derivative with 
respect to p is given by 

dg(z,p) 1 



5p 




(49) 
(50) 



We are interested in the sign of g (z,p), whose derivative is 
in turn given by 



dqo(z,p) lnz / i p \ 

dp (i + p) 2 \ ) 



dp 

We readily see that 

dgo(z,p) 

dp 
dgo(z,p) 

dp 



(i + p) 2 

>o, pe(-i,i], 
<o, pe[i,+oo). 



(51) 

(52) 
(53) 



Summarizing, since g(z,p) is continuous in p for p > —1, 
we have that 

• < in p e (-1,0], since O (z,p) is non- 
decreasing and g (z,p) < g o (z,0) = 0, 

• 9gl Q p p ^ > in p e [0, 1], since <?o(<2, p) is non-decreasing 
and O (z,p) > 9o(z,0) = 0, 

• dg ^dp P ^ > in p e [l,oo), since g (z,p) is non- 
increasing and go(z,p) > lim go(z,p) = 0. 

>-oo 

The fact that 0) = concludes the proof. 



Appendix B 

We wish to prove that the partial derivative - - 



p=0 



is bounded. To this end, we first note that the function 

Eq(p, s = 1) can be expressed as 



£ (p,s = l) = -log£ 2 



-pi(x ; y) 



(54) 
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where i(x;y) is the information density, defined as 



i(x; y) = lor 



Py(v) 



(55) 



The function Eq(p, s = 1) is a cumulant generating function. 
Its third derivative evaluated at p = gives the third-order 
cumulant, that is the third-order central moment, 



d 3 E (p,s = l) 



dp 3 



E 



p=0 



(i(X;Y) - I(X-Y)) 3 (In 2) 



(56) 

The next result shows that the fc-th absolute moment of the 
information density is bounded. 

Lemma 2: Consider a memoryless channel with discrete 
input alphabet X and arbitrary output alphabet y. Then, with 
equiprobable inputs we have 



E 



\i(X;Y) - I(X;Y)\ 



< 2 log 1*1 



k 
ln2 



(1 + \X\\ 



Proof: We will make use of Minkowski's inequality \\A + 
B\\ k < \\A\\ k + \\B\\ k where ||A|| fc 4 (E[|A| fc ])i . Using the 
definition of i(X, Y), we now have that 



i(X;Y) - I(X;Y)\\ k 



< 



log 



Y\X 



(Y\X) 



+ I(X;Y) 



< 21og + 



< 2 log \X\ + 



log 



In 2 



E x >Py\x(Y\x>) 



Py\x(Y\X) 



E X ' Py\x(Y\x') 
Py\x(Y\X) 



<21og W + A + ^_ W i 



Z x > Py\x(YW) 
Py\x(Y\X) 



(58) 
(59) 
(60) 

(61) 
(62) 



where we have used that lnx < k(x* — 1) [20, Eq. (4.1.37)]. 

■ 

Using Lemma 2, we have that for BIMS channels, 



d 3 E (p, S = l) 



dp 3 



p=0 



<(ln2)^(2+— (1 + 23)) . (63) 



References 

[1] M. Hellman and J. Raviv, "Probability of error, equivocation, and the 
Chernoff bound," IEEE Trans. Inf. Theory, vol. 16, no. 4, pp. 368-372, 
Jul. 1970. 

[2] I. Land, Reliability Information in Channel Decoding: Practical Aspects 
and Information Theoretical Bounds, Ph.D. thesis, University of Kiel, 
Germany, 2004. 

[3] I. Land and J. Huber, "Information combining," Foundations and Trends 
in Communications and Information Theory, vol. 3, no. 3, pp. 227-330, 
2006. 

[4] R. G. Gallager, Information Theory and Reliable Communication, John 

Wiley & Sons, Inc. New York, NY, USA, 1968. 
[5] I. Sason, "On universal properties of capacity-approaching LDPC code 

ensembles," IEEE Trans. Inf. Theory, vol. 55, no. 7, pp. 2956-2990, 

Jul. 2009. 

[6] E. Ankan, "Channel polarization: A method for constructing capacity- 
achieving codes for symmetric binary-input memoryless channels," 
IEEE Trans. Inf. Theory, vol. 55, no. 7, pp. 3051-3073, Jul. 2009. 

[7] I. Land, S. Huettinger, P. A. Hoeher, and J. B. Huber, "Bounds on 
information combining," IEEE Trans. Inf. Theory, vol. 51, no. 2, pp. 
612-619, Feb. 2005. 

[8] I. Sutskover, S. Shamai, and J. Ziv, "Extremes of information com- 
bining," IEEE Trans. Inf. Theory, vol. 51, no. 4, pp. 1313-1325, Apr. 
2005. 

[9] C. E. Shannon, R. G. Gallager, and E. R. Berlekamp, "Lower bounds 
to error probability for coding on discrete memoryless channels. I," 
Information and Control, vol. 10, no. 1, pp. 65-103, 1967. 

[10] S. Arimoto, "On the converse to the coding theorem for discrete 
memoryless channels," IEEE Trans. Inf. Theory, vol. 19, no. 3, pp. 
357-359, May 1973. 

[11] F. Jelinek, Probabilistic Information Theory, McGraw-Hill New York, 
1968. 

[12] C. E. Shannon, "Certain results in coding theory for noisy channels," 
Information and Control, vol. 1, no. 1, pp. 6-25, 1957. 

[13] A. Feinstein, "A new basic theorem of information theory," IRE Trans. 
Inf. Theory, vol. 4, no. 4, pp. 2-22, 1954. 

[14] A. Martinez and A. Guillen i Fabregas, "Random-coding bounds for 
threshold decoders: Error exponent and saddlepoint approximation," in 
2011 IEEE Int. Symp. Inf. Theory, Saint Petersburg, Russia, Jul. -Aug. 
2011. 

[15] Y. Polyanskiy, H. V. Poor, and S. Verdu, "Channel coding rate in the 

finite blocklength regime," IEEE Trans. Inf. Theory, vol. 56, no. 5, pp. 

2307-2359, May 2010. 
[16] N. Shulman and M. Feder, "Random coding techniques for nonrandom 

codes," IEEE Trans. Inf. Theory, vol. 45, no. 6, pp. 2101-2104, Sept. 

1999. 

[17] I. Sason and S. Shamai, "Performance Analysis of Linear Codes Under 
Maximum-likelihood Decoding: A Tutorial," Foundations and Trends 
on Communications and Information Theory, vol. 3, no. 1-2, 2006. 

[18] E. Ankan and E. Telatar, "BEC and BSC are Erj extremal," unpublished 
technical note, 2008. 

[19] M. Alsan, "Externality properties for Gallager's random coding expo- 
nent," in 2012 IEEE Int. Symp. Inf. Theory, Cambridge MA, USA, Jul. 
2012, pp. 2944-2948. 

[20] M. Abramowitz and I. A. Stegun, Handbook of mathematical funcions, 
Dover Publications, 1964. 



